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1 A semisupervised learning method to merge search engine results 
Luo Si , Jamie Callan 

ACM Transactions on Information Systems (TOIS) October 2003 
Volume 21 Issue 4 

The proliferation of searchable text databases on local area networks and the Internet causes 
the problem of finding information that may be distributed among many disjoint text 
databases (distributed information retrieval). How to merge the results returned by selected 
databases is an important subproblem of the distributed information retrieval task. Previous 
research assumed that either resource providers cooperate to provide normalizing statistics or 
search clients download all retrie ... 
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2 Coverage, relevance, and ranking: The impact of query operators on Web search engine 100% 
results 

Caroline M. Eastman , Bernard J. Jansen 

ACM Transactions on Information Systems (TOIS) October 2003 
Volume 21 Issue 4 

Research has reported that about lO&percnt; of Web searchers utilize advanced query 
operators, with the other 90&percnt; using extremely simple queries. It is often assumed that 
the use of query operators, such as Boolean operators and phrase searching, improves the 
effectiveness of Web searching. We test this assumption by examining the effects of query 
operators on the performance of three major Web search engines. We selected one hundred 
queries from the transaction log of a Web search servic ... 
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3 Search 1 : Expert agreement and content based reranking in a meta search environment using 100% 
Pffi Mearf 

B. Uygar Oztekin , George Karypis , Vipin Kumar 

Proceedings of the eleventh international conference on World Wide Web May 2002 
Recent increase in the number of search engines on the Web and the availability of meta 
search engines that can query multiple search engines makes it important to find effective 
methods for combining results coming from different sources. In this paper we introduce 
novel methods for reranking in a meta search environment based on expert agreement and 
contents of the snippets. We also introduce an objective way of evaluating different methods 
for ranking search results that, is based upon implici ... 



4 Building efficient and effective metasearch engines 100% 
Weiyi Meng , Clement Yu , King-Lup Liu 
ACM Computing Surveys (CSUR) March 2002 
Volume 34 Issue 1 

Frequently a user's information needs are stored in the databases of multiple search engines. 
It is inconvenient and inefficient for an ordinary user to invoke multiple search engines and 
identify useful documents from the returned results. To support unified access to multiple 
search engines, a metasearch engine can be constructed. When a metasearch engine receives 
a query from a user, it invokes the underlying search engines to retrieve useful information 
for the user. Metasearch engines have ... 



5 A highly scalable and effective method for metasearch 100% 
Weiyi Meng , Zonghuan Wu , Clement Yu , Zhuogang Li 
ACM Transactions on Information Systems (TOIS) July 2001 
Volume 19 Issue 3 

A metasearch engine is a system that supports unified access to multiple local search engines. 
Database selection is one of the main challenges in building a large-scale metasearch engine. 
The problem is to efficiently and accurately determine a small number of potentially useful 
local search engines to invoke for each user query. In order to enable accurate selection, 
metadata that reflect the contents of each search engine need to be collected and used. This 
article proposes a highly scalable ... 



6 Modeling score distributions for combining the outputs of search engines 100% 
R. Manmatha , T. Rath , F. Feng 

Proceedings of the 24th annual international ACM SIGIR conference on Research and 
development in information retrieval September 2001 

In this paper the score distributions of a number of text search engines are modeled. It is 
shown empirically that the score distributions on a per query basis may be fitted using an 
exponential distribution for the set of non-relevant documents and a normal distribution for 
the set of relevant documents. Experiments show that this model fits TREC-3 and TREC-4 
data for not only probabilistic search engines like INQUERY but also vector space search 
engines like SMART for English. We have als ... 



7 Transparent Queries: investigation users' mental models of search engines 
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Jack Muramatsu , Wanda Pratt 

Proceedings of the 24th annual international ACM SIGIR conference on Research and 
development in information retrieval September 2001 

Typically, commercial Web search engines provide very little feedback to the user 
concerning how a particular query is processed and interpreted. Specifically, they apply key 
query transformations without the users knowledge. Although these transformations have a 
pronounced effect on query results, users have very few resources for recognizing their 
existence and understanding their practical importance. We conducted a user study to gain a 
better understanding of users knowledge of and reac ... 

8 Information retrieval on the web 100% 
[^j Mei Kobayashi , Koichi Takeda 

ACM Computing Surveys (CSUR) June 2000 
Volume 32 Issue 2 

In this paper we review studies of the growth of the Internet and technologies that are useful 
for information search and retrieval on the Web. We present data on the Internet from several 
different sources, e.g., current as well as projected number of users, hosts, and Web sites. 
Although numerical figures vary, overall trends cited by the sources are consistent and point 
to exponential growth in the past and in the coming decade. Hence it is not surprising that 
about 85% of Internet user ... 

9 Experiences with selecting search engines using metasearch 100% 
Pffi Daniel Dreilinger , Adele E, Howe 

ACM Transactions on Information Systems (TOIS) July 1997 
Volume 1 5 Issue 3 

Search engines are among the most useful and high-profile resources on the Internet. The 
problem of finding information on the Internet has been replaced with the problem of 
knowing where search engines are, what they are designed to retrieve, and how to use them. 
This article describes and evaluates SawySearch, a metasearch engine designed to 
intelligently select and interface with multiple remote search engines. The primary 
metasearch issue examined is the importance of carefully selecti ... 

10 Learning search engine specific query transformations for question answering 100% 
|jj Eugene Agichtein , Steve Lawrence , Luis Gravano 

Proceedings of the tenth international conference on World Wide Web April 2001 

11 Towards a highly-scalable and effective metasearch engine 100% 
1^) Zonghuan Wu , Weiyi Meng , Clement Yu , Zhuogang Li 

Proceedings of the tenth international conference on World Wide Web April 2001 

12 Web Information Retrieval: Using sampled data and regression to merge search engine 100% 
Qj results 

Luo Si , Jamie Callan 

Proceedings of the 25th annual international ACM SIGIR conference on Research and 
development in information retrieval August 2002 

This paper addresses the problem of merging results obtained from different databases and 
search engines in a distributed information retrieval environment. The prior research on this 
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problem either assumed the exchange of statistics necessary for normalizing scores 
(cooperative solutions) or is heuristic. Both approaches have disadvantages. We show that 
the problem in uncooperative environments is simpler when viewed as a component of a 
distributed IR system that uses query-based sampling to cr ... 

13 Rank-preserving two-level caching for scalable search engines 100% 
Pj| Paricia Correia Saraiva , Edleno Silva de Moura , Novio Ziviani , Wagner Meira , Rodrigo 

Fonseca , Berthier Riberio-Neto 

Proceedings of the 24th annual international ACM SIGIR conference on Research and 
development in information retrieval September 2001 

14 Architecture of a metasearch engine that supports user information needs 100% 
Plj Eric J. Glover , Steve Lawrence , William P. Birmingham , C. Lee Giles 

Proceedings of the eighth international conference on Information and knowledge 
management November 1999 

When a query is submitted to a metasearch engine, decisions are made with respect to the 
underlying search engines to be used, what modifications will be made to the query, and how 
to score the results. These decisions are typically made by considering only the user's 
keyword query, neglecting the larger information need. Users with specific needs, such as 
“research papers” or “homepages,” are not able to express these 
needs in a way that affects the decisions made b ... 

15 Rank aggregation methods for the Web 100% 
1^1 Cynthia Dwork , Ravi Kumar , Moni Naor , D. Sivakumar 

Proceedings of the tenth international conference on World Wide Web April 2001 

16 A case study in web search using TREC algorithms 100% 
Q| Amit Singhal , Marcin Kaszkiel 

Proceedings of the tenth international conference on World Wide Web April 2001 

17 Placing search in context: the concept revisited 100% 
ACM Transactions on Information Systems (TOIS) January 2002 

Volume 20 Issue 1 

Keyword -based search engines are in widespread use today as a popular means for 
Web-based information retrieval. Although such systems seem deceptively simple, a 
considerable amount of skill is required in order to satisfy non-trivial information needs. This 
paper presents a new conceptual paradigm for performing search in context, that largely 
automates the search process, providing even non-professional users with highly relevant 
results. This paradigm is implemented in practice in the Intelli ... 

18 Searching the Web 100% 
ACM Transactions on Internet Technology (TOIT) August 2001 

Volume 1 Issue 1 

We offer an overview of current Web search engine design. After introducing a generic 
search engine architecture, we examine each engine component in turn. We cover crawling, 
local Web page storage, indexing, and the use of link analysis for boosting search 
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performance. The most common design and implementation techniques for each of these 
components are presented. For this presentation we draw from the literature and from our 
own experimental search engine testbed. Emphasis is on introduci ... 



19 Best Paper: Early experiences with a 3D model search engine 100% 
Patrick Min , John A. Halderman , Michael Kazhdan , Thomas A. Funkhouser 
Proceeding of the eighth international conference on 3D web technology March 2003 
New acquisition and modeling tools make it easier to create 3D models, and affordable and 
powerful graphics hardware makes it easier to use them. As a result, the number of 3D 
models available on the web is increasing rapidly. However, it is still not as easy to find 3D 
models as it is to find, for example, text documents and images. What is needed is a \3D 
model search engine," a specialized search engine that targets 3D models. We created a 
prototype 3D model search engine to investigate the d ... 



20 Placing search in context: the concept revisited 100% 
Lev Finkelstein , Evgeniy Gabrilovich , Yossi Matias , Ehud Rivlin , Zach Solan , Gadi 
Wolfman , Eytan Ruppin 

Proceedings of the tenth international conference on World Wide Web April 2001 
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